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Abstract — The problem of random number generation dates 
back to von Neumann's work in 1951. Since then, many algo- 
rithms have been developed for generating unbiased bits from 
complex correlated sources as well as for generating arbitrary 
distributions from unbiased bits. An equally interesting, but less 
studied aspect is the structural component of random number 
generation as opposed to the algorithmic aspect. That is, given 
a network structure imposed by nature or physical devices, 
how can we build networks that generate arbitrary probability 
distributions in an optimal way? 

In this paper, we study the generation of arbitrary probability 
distributions in multivalued relay circuits, a generalization in 
which relays can take on any of N states and the logical 
'and' and 'or' are replaced with 'min' and 'max' respectively. 
Previous work was done on two-state relays. We generalize these 
results, describing a duality property and networks that generate 
arbitrary rational probability distributions. We prove that these 
networks are robust to errors and design a universal probability 
generator which takes input bits and outputs arbitrary binary 
probability distributions. 



I. Introduction 



A. Motivation 



Many biological systems involve stochasticity. Examples 
of these include gene expression[4|, chemical reactions|5|, 
and neuron signaling|6|. However, despite the stochasticity, 
often deemed as noise, they are still capable of achieving 
functionalities that artificial systems cannot yet compete with. 

Motivated by the idea that stochasticity is an important en- 
abler of biological computation, we tackle a simpler question 
as a stepping stone: Can we design networks that generate 
stochasticity in a systematic way? 

B. Structural Aspects of Random Number Generation 

This question is strongly connected to an important thread 
of work in computer science on random number generation. In 
1951, Von Neumann [ 1 1 studied this problem in the context of 
generating a fair coin toss from a biased coin. Knuth and Yao 
J21 studied the reverse problem of generating arbitrary distri- 
butions from unbiased bits (fair coins), which was extended by 
Han and Hoshi [3] to generating arbitrary distributions from 
biased distributions. These primarily focus on an algorithmic 
perspective to random number generation. 

In a biological system (or other physical device), we do 
not have the full generality of the algorithmic approach to 



generating probability distributions. Randomness arises in spe- 
cific areas that can only propagate according to the structure 
of how components are composed in the given network. The 
study of random number generation under these constraints 
can be greatly beneficial to understanding natural stochastic 
systems as well as to designing devices for random number 
generation. In this paper, we will analyze random number 
generation in the context of multivalued relay circuits, a 
generalization of standard relay circuits to any number of 
states. This generalization is very natural and has an intuitive 
understanding as the timing of events that have dependencies 
on other events. 

C. Deterministic Relays 

We will start by introducing deterministic relays. A deter- 
ministic relay switch is a 2 terminal object which can be 
connected (closed) by a wire or left open. The state of the 
switch, which can be either or 1, describes an open or 
closed state respectively (see Figure [T^). These states are 
complements of each other. That is, if switch x is in state 
0, then the complement x is in state 1 and vice versa. When 
multiple switches are composed together, these networks are 
known as relay circuits. One of these, a series composition, 
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Fig. 1. Deterministic Examples. For two states, xy is the boolean 'and' 
of x and y while x + y is the boolean 'or' . Figures b and c also apply to 
multivalued relays where xy is min(x,y) and x + y is max(x,y). (a) A 
relay can either be opened (state 0) or closed (state 1). (b) In series, the entire 
circuit can only be closed if both x and y are closed. In parallel, the entire 
circuit will be closed if either x or y are closed, (c) In this example of a 
non-sp circuit, we take the boolean 'and' along the four possible paths, and 
then apply the boolean 'or' to those subsequent values. 



is formed when two switches are put end to end. A parallel 
composition is formed when two switches are composed so 
that the beginning terminals are connected together and the 
end terminals are connected together. 

Shannon showed that the series and parallel compositions 
can be represented by the boolean 'and' and 'or' operations[8|. 
If switches x and y are composed in series to form z\, then 
z\ will only be closed if both x and y are closed. On the other 
hand, if switches x and y are composed in parallel to form 
22, then Z2 will be closed if either x or y are closed. We will 
denote the series composition of x and y by x * y or simply xy 
and the parallel composition by x + y (see Figure [lj>). This 
notation will be preserved for further generalizations of the 
relay circuits. 

Circuits formed solely by series and parallel compositions 
are called sp circuits. Shannon showed that non sp circuits 
could also be represented by boolean operations. The general 
mathematical representation of any relay circuits would be to 
find all paths that go from the beginning to the end terminal. 
For each path, we take the boolean 'and' of all switches along 
that path; then we take the boolean 'or' of the values derived 
for each path (see Figure [TJ;). 

D. Stochastic Relays 

Recently, Wilhelm and Bruck introduced the notion of a 
stochastic relay circuit||9). These circuits are a generalization 
of Shannon's relay circuits; instead of having deterministic 
relay switches that are in either the open or closed state, 
stochastic relay circuits can exist in either state with a specified 
probability. If a stochastic relay switch x, called a pswitch, has 
probability p of being closed and probability 1 — p of being 
open, this distribution is represented by a vector v = (l—p,p) 
where v.- L corresponds to the probability of x being in state 
i. We say that x realizes (1 — p,p) or simply x realizes 
p. If pswitches x and y, which realize probabilities p and 
q respectively, are composed in series, the new composition 
will realize pq. If they are composed in parallel, the new 
composition will realize p + q — pq (see Figure [2}. 

One of the primary questions dealt with in their work was 
the generation of probability distributions using a limited num- 
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Fig. 2. Simple Examples. As stated, 



i is short for i). (a) A single 

i \ 

2 -pswitch. (b) Putting (a) in parallel with a ^-pswitch gives f. (c) Putting 
(b) in series with a ^-pswitch gives §, (d) Putting (c) in parallel with a 
2 -pswitch gives ±g. 



ber of base switch types, known as a switch set. For example, 
if the switch set S = {^}, then relay circuits built with this 
switch set can only use pswitches with the distribution (|, |). 
They proved that using the switch set S = {^}, all probability 
distributions ^ could be realized with at most n pswitches. 
Continuing, many more results were proved not only in real- 
izing other probability distributions [ 9 ] [ 1 1 , but also in circuit 
constructions such as a Universal Probability Generatorl 9 1 and 
in robustness lfm and expressibility[ 10] properties of these 
circuits. 

E. Multivalued Stochastic Relays 

In order to study non-bernoulli random variables, it is 
necessary to generalize Shannon's relays to a larger number of 
states. Multivalued logics have been studied as early as in 1921 
by Post lTTSl and followed up on by Webb[13| and others! 14 1. 
The work presented in this paper concerns one generalization 
of two-state relay circuits to multivalued relay circuits where 
we use two multivalued functions (gates). 

A multivalued switch is a relay switch that can be in any 
of n states: 0, 1, 2, n-1. We define the complement of a 
switch to be n — 1 — i, where i is the state of the switch. Series 
and parallel compositions are redefined to 'min' and 'max', 
respectively, rather than the boolean 'and' and 'or'. This means 
that when switches x and y are composed in series, the overall 
circuit is in state min(x, y) and when they are composed in 
parallel, the overall circuit is in state max(a;, y) (see Figure 
[3^ and b). Non-sp circuits are also defined in a similar way 
to 2-state circuits. The general mathematical representation of 
any multivalued relay circuit is to find all paths that go from 
the beginning to the end terminal. For each path, we take the 
'min' of all switches along that path; then we take the 'max' 
of the values derived for each path (Figures [TJi and c still 
apply). 

One physical understanding of this max-min algebra is 
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Fig. 3. 3-state Examples. We use X to represent a deterministic switch and 
® to represent a pswitch. (a) A comparison of the truth tables for 2-state and 
3-state relay switches, (b) A simple 3-state deterministic example. Evaluating 
series connections using min and parallel connections as max, we find that 
this circuit is in state 1. (c) A simple 3-state stochastic example. On the top, 
we put a (i, 0, =) pswitch in parallel with a deterministic switch in state 1. 
Then, with „- probability we get max(0, 1) and with „- probability we get 
max(2, 1), which yields a (0, circuit. Similarly, we can calculate the 

bottom circuit to realize distribution (|. j, i). 



found in the timing of relay switches. Let a switch start off in 
the closed position. Then the state of the switch will be the 
time t E {0, 1, n — 1} when the switch opens. Compose 
two switches, which open at time t\ and in series. If 
either switch opens, then the overall connection is broken. 
Therefore, the time that the series composition will be open 
is min(ti, t-i). In the same way, we can compose the switches 
in parallel. Since both switches need to be opened in order 
for the overall circuit to be open, the time calculated for the 
parallel composition is mBx{t\,t<i). More generally, the max- 
min algebra can be used to understand the timing of events 
when an event occurence depends on the occurence of other 
events, i.e. event A occurs if event B occurs or if event C and 
D occur. Max-min functions have been studied as a way of 
reasoning about discrete event systems|15|. 

Now that we have defined multivalued relays, we can also 
reason about probability distributions over the n states of 
a multivalued stochastic relay. If a 3-state pswitch x has 
probability p of being in state 0, p\ of being in state 1, and 
P2 of being in state 2, we represent the distribution with the 
vector v = (po,pi,P2) an d sa Y x realizes (po,Pi,P2)- If the 
distribution is in the form (1 — p, 0, 0,p), we will shorten 
this to simply p if the number of states can be inferred from 
the context or if the equation holds for any number of states 
(see Figure [3]:). 

We present the following results in our paper: 

1) A duality property (Section II). 

2) Networks for generating binary probability distributions 
(Sections III and IV). 

3) Networks for generating rational distributions (Section 
V). 

4) Robustness of the previous networks to switch errors 
(Section VI). 

5) Universal Probability Generation (Section VII). 

6) Switching with partially ordered states (Section VIII). 

7) Applications to neural circuits and DNA Computing 
(Section IX). 

II. Duality 

It is important to characterize properties of multivalued 
circuits. One well-known property is duality, which plays a 
role in resistor networks, deterministic and two-state stochastic 
relay circuits. We show that a similar duality concept exists 
for multivalued circuits. Define the dual state of i as the state 
n — 1 — i; the dual distribution of v as the distribution v where 
Vi — v n -i-i\ the dual switch of x as the switch x that realizes 
the dual distribution of x; and the dual circuit of C as the 
circuit C that realizes the dual distribution of C. 

Theorem 1 (Duality Theorem): Given a stochastic series- 
parallel circuit C, we can construct C by replacing all the 
switches in C with their dual switches and by replacing series 
connections with parallel connections and vice versa, (see 
Figure |4j. 

Proof: This is shown using induction on series-parallel 
connections. 
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Fig. 4. 3-state Duality Example. Remember that the 3-state = -pswitch is 
the shorthand for the (|, 0, |) pswitch. (a) A two element duality example. 
(b) A duality example on a larger circuit. Note that since the pswitches 
are symmetric distributions (duals of themselves), only the compositions are 
changed. 



Base Case: The dual of a single pswitch with distribution 

{Po,Pi, -,Pn-x) is (Pn-i, -,Pi,Po), which trivially satisfies 
the theorem. 

Inductive Step: Suppose a circuit C with distribu- 
tion (po>f>i> ■••>Pn-i) an d a circuit C with distribution 
(qo, qi, q n -i) satisfy the theorem, i.e. the distribution 
of C is (Pn-i, ...,pi,po) and the distribution of C' is 
(<3Vi-i, <2i, qo). To prove the theorem, it is sufficient for us 
to show that CC is the dual of C + C'. 

Let C s = CC and C p = C + O . Then C s = 
(c , ci, c„_i) and C p = (d , d 1: where 

c k = p < c = ^ p < c ' = ft 

min(i,j)—k 

= 

d k = Pr( - C = l ) Pr ^ C ' = 3) 

max(i^j)—k 

= Pn-l-iQn-l-j 
max{i 1 j)—k 

Pn-l-iQn-l-j 

min(n—l—i,n—l—j)=n—l—k 

Y Pi'Qj' 
min(i' ,j')—n — l — k 

= c n-l-k- 

We see that d = c„_i, d\ = c„_ 2 , d n -i = c , 
demonstrating that C s is the dual of C p . ■ 

III. Realizing Binary 3-state Distributions 

We can now ask questions about generating probability 
distributions with stochastic relay circuits. These include: 
What are the possible distributions that can be realized? What 
is the smallest set of basic switching elements necessary to 
realize these distributions? Are there efficient constructions to 
realize these distributions? We will begin by demonstrating 
how to generate distributions on 3 states. 
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Fig. 5. Realizing 3-state Distribution, (a) We want to realize (|, ~, ~). 
This falls under the first case, (b) Now we have an unconstructed distribution 
that falls under the second case, fcj There are two unconstructed distributions. 
The top one falls under case 3 and the bottom falls under case 2. (Note that 
case 2 is equivalent to case 1 for po = \)- (d) The final circuit. 



In writing out algorithms and circuit constructions we will 
use the notation introduced earlier for series and parallel 
connections, i.e. xy and x + y will denote series and parallel 
connections respectively. We will use this notation loosely, i.e., 
(|, 0, |)(|, |,0) represents a circuit formed by composing a 
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) pswitch in series with a (4, i,0) pswitch. 
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Lemma 1 (3-state sp composition rules): Given 
P = {Po,Pi,P2) and q = (q a ,q 1 ,q 2 ), let z = pq 
y = p + q. Then, 
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Proof: The above expressions follow from enumerating 
all 3 2 switch combinations. ■ 
Theorem 2 (Binary 3-state Distributions): Using the 
switch set S = {i} and the deterministic switches, we can 
realize all 3-state distributions of the form ~, ^) using 
at most 2n — 1 pswitches with the following recursive circuit 
construction (see Figure [5] for an example): 



if p e S then 

return p 
else if ^ > I then 

return (§,(), §)(2=££ 
else if ^ > \ then 

return ( , ^ 
else if > \ then 

return (5^, ^hr, 
end if 
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Fig. 6. Block-Interval visualization of 3-state Algorithm. We want to 
realize ( | , 1 , | ) . A (g symbol marks where the cuts occur which correspond 
to the use of one pswitch. (a) We start with one interval split into blocks 
corresponding to the pswitch probabilities, (b) When we cut [0, 1] in half, we 
get two intervals; the blocks are cut and the probabilities change according to 
the new interval size, (c) The second application of the algorithm, (d) When 
we are left with single block-interval pairs, we know we are done since the 
deterministic switch is in our switch set. 



the 3 recursive cases enumerated above. We can verify that 
for each of these cases: 

1) The decompositions obey the 3-state composition rules. 

2) The switches are valid (non-negative probabilities and 
sum to 1) 

Since each algorithm's decomposition uses switches of type 
(5^") 2>r=r) 5?£t)> men we w iH eventually have n = 0, 
corresponding to a deterministic switch; at this point the 
algorithm terminates and has successfully constructed any 

/ _a_ Jj_ \ 
\2 n ' 2" ' 2™ /' 

We will now prove that we use at most 2n — 1 pswitches for 
all n > 1. Define /„ as the maximum number of pswitches 

2%). Then, 

(1) 

1) + !,/„_! + !) (2) 



used in the construction of any distribution ( ~ , 



/i = l 

/„ = max(/„_i 
= max(/ n _i 



l,2(n- 
l,2n- 



where ([T| is shown trivially, and |2]) is derived from the 3 
cases of the algorithm. Note that we are using a previous 
result[9| that 2 state distributions of form (^-, J^-) use at 
most n switches, and also assuming here that our algorithm 
generalizes the previous algorithm; that is, distributions of 
the form ( S; , Jr , 0) and its permutations also use at most 
n switches - this application of their 2-state proof to our 
multivalued distribution is sound as will be explained more 
rigorously in the following section. We are now left with a 
simple induction exercise. 

Base Case: /1 = 1 = 2(1) - 1. 

Inductive Step: Assume fk = 2k — 1. 



Proof: For any distribution p = (f>o>Pi>P2)> there exists 
some smallest k such that X)i=oP« > h' w ^ich correspond to So //. 



fk+i = max(/ fc + l,2(fc 4 
= max(2fc,2(fc + 1) 
= 2(fc + l) -1. 

2k -1 => / fc+1 =2(fc + l) 



Then, 
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Fig. 7. Realizing 4-state Distribution. We use the TV-state algorithm on 
the a 4-state distribution. A £3 symbol represents the pswitch (^,0,0, |). 
fa) We want to realize (|, |, j, g). Initially, po +Pi > \- (b) For the top 
distribution, po+Pi +P2 > § ■ For the bottom distribution, po +Pi > \ ■ (c) 
For the top distribution, po +Pi +P2 +P3 > 1. For the bottom distribution, 
PO + P1 + P2 > 5. (d) The final circuit. 



It is useful at this time to provide some intuition regarding 
the algorithm. We can view the original distribution as a series 
of blocks dividing up the interval [0, 1] (see Figure rol. By 
applying the algorithm, we are separating this larger interval 
into smaller intervals [0, |] and 1], cutting any blocks on 
that boundary. When this separation occurs, the total size of the 
block is decreased (namely, in half), and so the probabilities 
representing those intervals change - these probabilities are 
precisely those of the algorithm. Namely, we can rewrite the 
three cases of the algorithm: 
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Case 1 
Case 2 
Case 3 



(1,0,0) + (|,0,i)(2=^i, 3 A r , g^r) 

/ a 2"" 1 
V2"- 1 ' 2"- 
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l, 2 n-l , 2 "-i 



,0) + (i,0,i)(0,^_£, 

j^) + (s,o,i)(o,o 1 i) 



IV. Realizing Binary TV-state Distributions 

We are now ready to continue with our algorithm for N 
states. Intuitively, we can describe the algorithm for N states 
in the same way as for 3 states. We first find the smallest 
index k for which Yli=oPi > §■ Then based on the index 
k, we can decompose our distribution in a way corresponding 
to the interval-block visualization; the only difference is that 
each interval can now have up to N block-types instead of 
just 3. 

Theorem 3 (Binary N-state Distributions): Using the 
switch set S = {i} and the deterministic switches, we can 
realize all iV-state distributions of the form 
with at most f n ,M = 

j 2" - 1, n< \log 2 N] 

| 2 rio g2 wi _i + (iV_i)( n _ [l g 2 7Y]), n> riog 2 JV] 

switches, using the following recursive circuit construction 
(see Figure [7]): 



Algorithm 1 Note that we are using | as a shorthand for 
^) as explained in the introduction. 



if p e S then 

return p 
else if f£ > \ then 

return |( ao 2 ; 2 "i - 
else if ^i^i 
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return {^rr 
else if ... then 
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return (-^fl- -^i 



> | then 
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end if 
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The algorithm's correctness can be confirmed from the 
composition rules for N-states. Before proving the complexity 
of the above algorithm, we state an intermediate lemma: 

Lemma 2: Define an active state of a distribution as a state 
with non-zero probability. Given any distribution of the form 
(§£, fr, with m < N active states, the number of 
pswitches needed to realize the distribution is at most / n>OT . 

Proof: Re-map the m active states to 0,1,..., m — 1 in 
a way that preserves state order. Since all operations under 
max and min are preserved, any distribution on the original m 
active states can be constructed by constructing a distribution 
on the mapped states and reversing the mapping on the base 
switches. Therefore, the number of pswitches required cannot 
be greater than f n>m . This also gives us insight into /„,at for 
the region n < [log 2 N] . In this region, the number of active 
states is limited by n, resulting in states with zero probability. 
Then, it makes sense that the complexity only depends on n. 

m 

Proof: We will now prove the complexity of pswitches 
required. The following recursive relations hold for / n ,jv, 



fn,l 

fo,N 

fn,N 



= 
= 

= max(/„_ M + f n -i )N + 1, 

Jn-1,2 + /n-l,JV-l + lj 

fn~l,N + /n-1,1 + 1) 
= max (/ n _l,i + fn-l,N-i+l) + 1 

i<i<rfi 



(3) 
(4) 
(5) 



(6) 



where ([3| and Q are trivial since both define deterministic 
switches and |5]) comes from an application of the above 
lemma to the different cases of the algorithm. If we fill out 
a table based on the recursive relations (see Table I), we find 
that the values match our closed form hypothesis. To prove it, 
we will: 

1) Demonstrate that ^ is maximized at i = [=5-]. 

2) Use induction to prove f n ,N for n < \\og 2 N~\ . 

3) Use induction to prove f Ut N for n > [log 2 N~\ . 
Part 1: Define A„ jA r = f n ,N+i - fn,N- 
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TABLE I 

(ROW, COLUMN) VALUES CORRESPOND TO (N , n). THE n = |"log 2 JV] 
BORDER IS BOLDED. 



Part 3: Prove /„ jv for n > [log 2 N~\ . 
Base Cases: 

f(n, 1) = = (1 - l)(n - [log 2 1]) + 2^= H - l. 
f(\\og 2 N],N) =2 riog2An - 1 (from Part 1) 

=(7v-i)(rio g2 ivi _ rio g2 7vi) 



If n> riog 2 (JV + l)], 

A„ )JV = [(n - [log 2 (7V + l)D(JV + 1 - 1) + 2^( JV+1 )T] 
-[(n- riog 2 /V])(iY-l) + 2^ JV l] 

For N a multiple of 2, riog 2 (7V + 1)1 = |~log 2 N] + 1, so 

A„ )JV - [(n - [log 2 N] - l)N + 2^2 

- [(n- riog 2 /Vl)(/V-l)+ 2^=^] 
= -N + n- \log 2 N]+N 
= n - Tlog 2 JV] 
For iV not a multiple of 2, [log 2 (yY + 1)] = |~log 2 N], so 
A„ jiv = n - |~log 2 AT] 

If n< \\og 2 N}, 

A 7hN = 2" - 2" = 

So we have, 



,N 



n-pogjjiVl, n> riog 2 (iV + l)l 
0, n < \\og 2 N] 



Since A n> ^ is nonincreasing for a fixed n and increasing TV, 
the maximum of f n -x,i + fn-i,N-i+i + 1 is at the highest i 
for which i < N — i + 1. Otherwise, we could increment i, 
increasing the entire quantity by A„_i^ — A„_ijv-,: > 0. 
Therefore the maximizing index is i = 

Part 2: Prove f n ^ for n < [log 2 N~\ . 
Base Case: /(0, N) = = 2° - 1. 

Inductive Step: Assume fk-i,N' = 2 fe_1 - 1 VA/ 7 < AT and 
fc - 1 < [log 2 N'] . Then for k < \\og 2 Af| , 



/fe,iv = /fe-i,rfi + /, 



/fe-i,rfi + 4-i,Lf j+i + 1 

2(2 fc ~ 1 - 1) + 1 



(7) 



= 2 K + 1 
where (|7) is true because 

k < \log 2 N] =>■ 



AT. 



AT. 



i< [io g2 -i < riog 2 r 2 

l<[log 2 (LyJ+l)l 



Inductive Step: Assume fk-i.N' = (N — l)(fe — 1 — 
[log 2 AT'l ) +2^2 N'] _ i VAT''< AT and k - 1 > [log 2 AT'] . 
Then, 

fk,N = /fe-i,rf i + /fe_i,iv-rf i+i + 1 

= /fe-i,ff i + /fc-Uf J+i + 1 

= (a - i - rio g2 rf nxryi - 1) + 2^ 2 rf n _ i 



AT 



AT 



+ (fc-l-riog 2 (L y J+l)l)(LyJ+l-l) 

+ 2 r io g2(Lfj+i)i + i 

= (fc-i-ri og2 ^i)([^i-i) +2 ^2fi_ 1 

+ (k - 1 - [log 2 (LyJ + l)D(LyJ) + 2^ 2 (Lf J+Dl 

For A^ not a multiple of 2, riog 2 ([f J + 1)1 = [log 2 f]. In 
this case, 

= (fc — 1 — [log 2 yl)(["yl - 1) + 2 riofe ^ 

+ (fc-l-ri0g 2 ^l)(LyJ)+2^2 fl_l 

= (fe-i-(r(io g2 jv)-ii)(ryi + Lyj-i) 

+ 2 rio g2 fi+i _ j 

= (jfe- rio g2 A^l)(Ar- 1) + 2^^! _i 

For N a multiple of 2, [log 2 (Lf J + 1)1 = [log 2 f ] + 1. In 
this case, 

= (fc - 1 - [log 2 yl)(rf 1 - 1) + 2^ f 1 - 1 

+ (k - 1 - ([log, |l + l))(LyJ) + 2^2 f 1+1 

= ( fc -i-riog 2 ^i)(Ar-i)-L|j 
+ 2 ri°g 2 fi + 2 ri°g 2 fi+i _i 

= (fc-i-(ri og2 iVl-i))(Ar-i)- y + y 
+ 2 r(i°g 2 f )+ii _ i 

= (jfe - riog 2 N]){N - 1) + 2^ ^1 _ i 



V. Realizing Rational Distributions 

Given the previous results, a natural question arises: Can we 
also generate probability distributions over non-binary frac- 
tions? More generally, can we generate distributions for any 
rational distribution? This question was studied for the 2-state 
case by Wilhelm, Zhou, and Bruck|9][10| for distributions of 
the form (^-, -\). In their work, they demonstrated algorithms 
for realizing these distributions for any q that is a multiple 
of 2 or 3. In addition, they proved that for any prime q, no 
algorithm exists that can generate all (-%, 4r) using the switch 
set S = |i, «=i}. 



rl 2 

I g' q ' ' q 

We approach the question of realizing rational TV state 
distributions from two angles and demonstrate algorithms for 
each. The key for these algorithms lie in a generalization of 
the Block-Interval construction of binary distributions. 

Lemma 3 (Block-Interval Construction of Distributions): 
Let p — (po,pi, ...,pn-i) be any distribution. Then given any 
'cut', represented by the distribution (g, 0, ...,0, 1 — q), and 
the index of the cut k satisfying q > ^2i=a Pi> 1 — Yli=oPi' 



(Po,Pi, -,PN-l) 
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q PkA 



,o, ...) + (<?, ...,i-q)] 
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1 



q 



l-q '1-g' 

Proof: The equation follows directly from the max-min 
composition rules. ■ 
The name of the lemma comes from an intuitive representa- 
tion of this equation. We can let probability distributions be 
represented by an interval covered with blocks. Each block 
corresponds to a different state of the distribution (a block 
can only be on the rightside of a block whose state it is 
greater than) and the ratio of the block length to the interval 
length represents the probability of the state. Then any cut 
along the interval will produce two separate block-intervals 
corresponding to exactly the distributions calculated above. 
This means that for any equation of this form, the block- 
interval cut will hold. Conversely, any cut of a block-interval 
corresponds to an equation with the appropriate distributions 
substituted, (see Figure [8}. 

A. State Reduction 

Given the results already proved on two state distributions, 
one natural way to tackle N states is to first reduce the given 
iV-state distribution into compositions of 2-state distributions. 
Then, if algorithms already exist for those 2-state forms, we 
are done. 

Theorem 4 (State Reduction Algorithm): Using the switch 
set S = {^}, we can reduce any N-state distribution of 
XN ~ 1 ) into at most N — 1 two-state 

a I 



the form (^. 
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= State 1 111 = State 3 



Fig. 8. Block-Interval visualization (rational). We want to realize 
(po ) Pl , P2 , P3 ) ■ A symbol marks where the cut occurs, which corresponds 
to the use of one ps witch, (a) We start with one interval split into blocks 
corresponding to the pswitch probabilities, (b) When we cut with pswitch 
(q, 1 — q), the interval is separated into 2 intervals: [0, q] and [q, 1]. 



distributions of the form (..., ^t, ...) using at most 
/io g29 ,Ar = (AT-l)(log 2 g-riog2iVl) +2^^1-1 switches 
with the following algorithm - for an example, see Figure [9^: 



if p is a 2 state distribution then 

return p 
else if si > i then 



return K^ 3 ,^,. 
else if si±5l > i then 
return f^a, 2=^a,... - 

V q ' q ' 

else if ... then 



Mo, 



-) 



2x +2x 1 ~q 2x 2 



1 ) 

2 V. "i q > q ' ■■•) 



else if 

return ( 
end if 



x +xi + ...+a:jv-i 



9 

2xq 2a:i 



> \ then 

2x JV -_i-gs 



Proof: To prove this theorem, we need to show that 1) the 
breakdown of probabilities in each round is valid, 2) the algo- 
rithm will eventually terminate, and 3) the construction will 
have used f\ og2 q ^ pswitches, where / n ,jv is the complexity 
defined for binary TV-state distributions. 

Notice that choosing q = 2 gives us the exact same 
algorithm as the binary TV-state algorithm (except for the 
terminate clause). As with the binary case, Part 1 of the proof 
follows directly from the max-min composition rules. 

We will now argue that the algorithm terminates. Let k 
be the number of 'rounds' the algorithm takes. Using the 
block-interval representation, we see that after each round, any 
interval will either have < 2 blocks (states) or the interval size 
will be A-. Then when the interval size is less than each 

2 K q 7 

interval cannot have more than 3 blocks; otherwise, the middle 
block will be less than i, which is not possible except for a 
zero probability. Therefore, we have proved that the algorithm 
will eventually terminate. 

To prove the complexity, we need to calculate the number of 
'rounds' before termination of the algorithm. This will happen 



8 3 14 
27~'27 '27" 



1,27 ' 27 ' 27jJ 

I 9? — h 

I 07 -.r, 07 I 



r^)-n 



V 27 27 7 



-(o,i,o)— 



Fig. 9. Algorithm Illustrations, (a) An example of the state recursion 
algorithm, (b) An example of the denominator recursion algorithm. 



at, 



1 1 

¥ ~ a 



2 k > q 
k > log 2 q 



Since the algorithm is identical to the binary iV-state distribu- 
tion, we can use the same complexity bound. In this case, n 
corresponds to the fc-th round when the algorithm terminates. 
Therefore, we use at most (N — l)(log 2 q — |~log 2 iV]) + 
2 riog 2 Aq _ i pswitches. " " ■ 



B. Denominator Reduction 



The second approach to realizing rational iV-state distri- 
butions is to use a different switch set to directly reduces 
the power in the denominator. The intuition comes from a 
generalization of the block-interval construction. Rather than 
just cutting it at one point, i.e. at ^ for realizing binary 



distributions, we can cut it at the points 
get q equally sized intervals. 



1 2 



and 



g-i 



to 



Theorem 5 (Denominator Reduction Algorithm): Using S 
= i h, h , ^ , -} and the deterministic switches, we can 

L 2 ' 3 ' 4 9 

realize any distribution XN ~ 1 ) using at most (iV — 

l)(q - l)(n - [log 9 AH) +V log<lA ^ - 1 switches with the 
following algorithm - for an example, see Figure [9b: 



Define the following quantities for j = 0, 1, 2, q. 
f~. ■< — • • • • • • 



vj — the smallest index at which Y^iLo Pi > \ 



Rj ■ - 

if p e S then 

return p 
else 



return ( 



x 



., x £t-i ,Rq,0, ...) 



+ i(...,0,L 2 ,^,...,^,i? 2 ,0,...) 
+... 

+^-i(—,0,L q -x, ? T-i , ... , g T-i ,R q -i,0, ...) 



end if 



Proof: The idea of the construction is to perform q — 1 
iterations of the block-interval construction lemma for each 
reduction of a pswitch. WLOG, let the original interval have 
length 1. Then the first cut at 2=_ gives intervals of length 
2^ and i. The second cut at 2=2 of the 9=1 
intervals of length 1=11=^. = q ^ an( j v-° 



interval gives 
2=U- = i. The 



third cut leaves an interval of length 2-^ 2-4 2-1 = — and 

& 9 9-1 9-2 9 

another | interval. As this continues, we get q intervals of 
length ~. Our expression is exactly that of the block-interval 
construction lemma applied to the above description. Then 
since at each 'round' of these decompositions, we reduce the 
denominator by 1, we will eventually terminate when n = 0, 
which is a deterministic switch. 

The proof for the complexity is similar to the iV-state binary 
proof. If we let f q . n ,N be the maximum number of pswitches 
needed, then we have the following recursive relations: 



fq,Q,N 
fq,n,N 



max 

VHj ij=N+q- 



(9- l + y"/g,n-l,^) 

9-1 



One observation to note is that for q — 2, we get the 
same recursive functions as in the binary algorithm, which 
is expected: 

h,n.X = 
/2,0,Af = 
J2,n,N — 



max / 2 

ii,i%:i\+i2=N— 1 



1 



To go from this recursive form to the closed form version, 
we use similar methods to the binary TV-state complexity 
function, so we will only outline the steps: 

1) First prove that the recursive expression is maximized 
when all ij <G { , [yj + 1}- This is done by showing 
that the closed form for fq : n,N has a nonincreasing 
discrete derivative in N, i.e. f q , n ,N+i — fq,n,N is non- 
increasing as N increases. 

2) Next show that the recursive expression evaluated at 
the appropriate ij will achieve the closed form by 



breaking it up into cases to simplify the ceiling and floor 
expressions. 

■ 

Corollary 1: Let q = PxP-i—P^^x > where pi are primes. 
Then using S = {^,|,j,..., — ^ — }, we can realize any distri- 
bution (2a, ^, using at most £\ /p^fc.iv pswitches. 

Proof: Here we are simply applying denominator reduc- 
tion on each p i 4 . The complexity is just a summation of each 
step. ■ 

Corollary 2: Let q = PyP^ —Pnuix> where pi are primes. 
Then using S = — - — }, we can realize any 
distribution ( ^ , ^ , . . . , ) . P '"° X 

Proof: Our claim here is that we can reduce the switch set 
to the inverse of the primes up to p ma x- This is because using 
all of these switches, we can generate any other i, where k is 
composite. However, this means that the number of pswitches 
will go up, i.e. | will now use 2 switches |* |. We have not 
analyzed the complexity that this reduction in the switch set 
would result in. ■ 

VI. Robustness of Probability Generation 

The above algorithms looked at probability generation given 
a fixed switch set of distributions. However, in physical 
systems, it may be the case that the generation of randomness 
is error-prone. If we want to use a pswitch with distribution 
(po,Pi), the physical pswitch may actually have distribution 
(po + e ,Pi — e )- Define the error of such a pswitch as 
|e| lfTT1l . Loh, Zhou, and Bruck looked at generating 2-state 
probabilities given pswitches with errors. They found that 
any binary distribution generated according to their algorithm, 
regardless of size, had error bounded by 2e. They also showed 
similar bounds for different distributions. 

We examine the same problem in the context of multivalued 
distribution generation and show that a generalized result holds 
for any number of states. Define the error of a multivalued 
distribution as the largest error over all the states. That is, if 
a pswitch with desired distribution (po>Pi> —1PN-1) has an 
actual distribution of (p + e ,Pi +6i, ...,pn-i +e.zv— i)> tnen 
the error of the switch is maxj |e,|. 

We will begin by demonstrating robustness for TV-state 
binary distributions generated according to the algorithm in 
section IV. This algorithm uses switches from the switch set S 
= {|} as well as the deterministic switches. For our analysis, 
we allow errors on the active states of pswitches. As a result, 
deterministic switches have no errors since the sum of the 
single active probability must equal 1. The i -pswitch has 
distribution (~ + e, 0, 0, | — e), |e| < e. 

Lemma 4 (Error Bound on Boundary States): Generate 
any distribution (..., 0, 0, ...) according to the 

binary TV-state algorithm where state i is the smallest active 
state and state j is the largest active state. If we allow at most 
e error on the pswitches in the switch set, then the actual 
distribution generated will be (..., 0, fi +Si, §J 0, ...) 
where |<S*| < 2e, \6j\ < 2e. 

Proof: In each step of the algorithm, a distribution 
r = (..., 7"j, rj, ...) is made out of 2 pswitches p = 



(...,Pi,...,p k ,...) and q= (..., q k , qj, ...) where i < j < k. 
The composition is in the form r = p + (|, ...,\)q. We will 
prove robustness using induction on switches p and q. 

Base Case: We use the deterministic switches as our base 
case switches (for these, i = k or k = j). These trivially 
satisfy the induction hypothesis since they have no error. 

Inductive Step: Assume we are given p and q satisfying 
the inductive hypothesis. That is, for p = (...,Pi + Aj, ...,Pk + 
A fc , ...), q = {...,q k + S k , ...,<7j+<5j, ...), |Ai|, |A fc |, \S k \, \6j\ < 
2e. Then the errors for states i and j on distribution r = 
p + (| + e, \ — e)q can be calculated as follows: 

|error j;<fc | = |(i + e)(pi + A 4 ) - \^p l \ 

= l^Ai + eipi + AJl 

< ilAil + lcllpj + Ail 

< 2e 

|error 4=fc | = |(I + g) + (i - e)(q t + Si) - (i + l -q % )\ 

= \\&i + K^-ifh + Si))\ 

<^W + |e||l-(ffi + «i)| 

< 2e 

|error J>fc | = |(i - e)(^ + 6j) - 

= + i( qj + s,)\ 

< + \e\\q 3 + S 3 \ 

< 2e 

|error J=fc | = |(i - e) + (- + e){ Pj + A,) - (i + - Pi )| 
= |iA i + e(l-( Pi +A i ))| 

< \\^ 3 \ + \m-(p 3 + ^)\ 

< 2e 

Then we find that the errors for states i and j still satisfy the 
inductive hypothesis, so we are done. ■ 

Theorem 6 (Robustness of Binary N -state Distributions): 
Generate any distribution ) according to the 

binary A^-state algorithm. If we allow the pswitches in the 
switch set S = | to have up to e error in the active states, then 
the actual distribution (fa + S , fi + <*i, ^f^ 1 + 6n-i) 
has errors < 3e, \Sq\, |5jv-i| < 2e. 

Proof: In each step of the binary A-state algorithm, a 
distribution r = (..., r^, r 3 ;, ...) is made out of 2 pswitches 
P = (...,Pi,...,Pfc,...) and q = (...,q k ,...,qj,...) where i < 
j < k, i is the smallest active state and j is the largest active 
state. The composition is in the form r=p+(|,...,|)g | . We 
will prove robustness using induction on switches p and q. 

Base Case: We use the deterministic switches as our base 
case switches. These trivially satisfy the induction hypothesis 



since they have no error. 

Inductive Step: Assume we are given p and q satisfying the 
inductive hypothesis. That is, for p = (...,Pi + Ai,...,p k + 
A fc ,...), q = {...,q k + S k ,..., qj + Sj,...), \A s \,\6 a \ < 3e, 
| Ai\, | Afe|, \8f.\, \Sj\ < 2e. Then the errors for states s on 
distribution r=p+(| + e,...,| — e)q can be calculated as 
follows: 

1) For s = i,j, we apply the previous lemma on boundary 
states 

2) For i < s < k,k < s < j, we have a similar expression 
as calculated in the proof for boundary error (steps 
skipped below since shown earlier). 

|error j;<s<fe | < ~|A a | + \e\\p s + A s \ 

< -e < 3e 
~ 2 ~ 

|error fc<s<J -| < -\S a \ + \e\\q s + S s \ 

3) For s = k, we have the following error: 

|error s=fe | = |(~ + e)(p k + A k ) + h~ e)(«fc + 4) 

= \^A k + ^S k + e[( Pk + A k ) - (q k + S k )}\ 

<l(|A fc | + |4|)+e 
< 3e 

Then we find that the errors for all states s still satisfy the 
inductive hypothesis, so for i = and j = N — 1, we are 
done. ■ 

Now we will show that the iV-state rational distributions 
generated according to the algorithms in Section V are also 
robust. For the state reduction algorithm, all the results are 
identical to the binary robustness results since the proofs did 
not assume that p and q were binary distributions. However, 
since the state reduction algorithm depends on other unknown 
2-state algorithms, the error will depend on the errors gener- 
ated from the 2-state algorithm. The robustness result for the 
denominator reduction algorithm is very similar to the one for 
binary distributions. The main difference is that we have a 
different switch set S = {|, |, |}. 

Theorem 7 (Robustness of Rational Distributions): 
Generate any distribution (ph ^k> ■■■■> XN q ^ 1 ) according 
to the denominator reduction algorithm. If we allow the 
ps witches in the switch set S = {5, •••>!} to have up 
to e error in the active states, then the actual distribution 
(^+4,fi+'5i,-,^ i +^-i)has errors |^| < (q + l)e, 

\M, \§n-i\ < qt- 

Proof: For generating rational distributions, each iteration 



repeats compositions of the form: 

(z , ...,z N -i) = (p , ...,p tl ,0, ...) 

+ ^(■■■,o,qi 1 ,-,qi 2 ,°> ■■•) 

+ ^(-,0,r i2 ,...,r i3 ,0, ...) 

+ ... 

1, 

+ -(■■■, 0,y ig _ i; ...,y iq ,0, ...) 

We notice that each composition of this form is the same 
as q — 1 compositions of the form (...,f>j, Pk, •••) + 
|(..., <?fe, <7j, •••) where q — 2,3, ...q. Then from here, the 
work is almost identical to the proof of the binary distributions, 
so we will only outline the steps. 

1) Prove that the boundary states of distributions formed 
from (..., Pi ,...,p k ,...) + i(...,q k ,...,qj,...) composi- 
tions, q £ {2, 3, q} have error of at most qe regardless 
of the number of compositions (use induction). As 
before, we allow pswitches (^j^ + e, | — e) to have 
error e < e. 

2) Using the boundary state error, prove that the errors of 
all states have error of at most (q — l)e. 

Then, since all the compositions making up the rational distri- 
butions are of the form (...,p it ...,Pk, •••) + f(-, qk, 
the steps above are sufficient for proving our theorem. ■ 

VII. Universal Probability Generator 

Up till now, we have only looked at circuits with set 
switches and described algorithms for realizing specific proba- 
bility distributions. A next question is: What about circuits that 
implement stochastic functions? That is, how can we reason 
about circuits that implement different probability distributions 
given 'input' switches? Wilhelm and Bruck approached this 
problem for the 2-state version[9) by constructing a circuit 
which they called a Universal Probability Generator (UPG). 
The function of this circuit was to map n-deterministic bits 
into output probabilities of the form |£ in increasing order. 

This functionality alone is not surprising since it can easily 
be done with an exponential number of switches using a 
tree-like structure; however, the remarkable result is that the 
UPG only requires a linear number of switches in n. In this 
section we will show a generalization of this circuit which 
is able to realize any binary probability distribution of the 
form (f£, §k, 1 ) usm S a number of switches that is 

polynomial in n with degree N — 1. 

The strategy we will take for deriving the UPG construction 
is first to ask: 'How should the UPG function?' Then we will 
build a circuit that naively implements the functionality and 
use algebraic rules to reduce the complexity of the circuit. 

A. 2-state UPG 

We will first review the 2-state UPG of Wilhelm and 
Bruck[9|, deriving the results in a manner that is similar to 
the steps we take to get the generalized A^-state UPG. 



Definition: (2-state UPG). A 2-state UPG U n is a circuit 
that realizes distributions of the form (§£,§£) using n + 1 
input bits which we will refer to as ro, ri, r n . When the 
input bits, in the order ro,r„,r„_i, r2,ri are set to the 
binary representation of §£, then the circuit f/„ will realize 
distribution |^). In other words, to realize any desired 
binary probability (p-, we set the input bits to the state 
probability. 

As an example, we look at the input-output mappings for 
the circuit U3. If we input r = 0001, the circuit will realize 
(|, I) since § = 0.001 2 . The input 0101 will realize (§ 



since | = O.IOI2 (see Figure 10 1) 



5 ) 
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The motivation for the UPG circuit comes from an interest- 
ing property in the truth table. For example, let us enumerate 



all the outputs for U3 (Fig. 10 1). For each row (input), we 
ask the following questions: What would the output of U2 be 
given the inputs ro,r2,ri that were used for U3I Is there a 
relationship to the construction of the U3 output probability? 

If we calculate these outputs, we find that they are the 
same probability distributions used in the binary algorithm for 



2-states (Fig. 10 tc). From here, the (exponential) recursive 
construction is straightforward. 

Lemma 5: A 2-state UPG U n with inputs ro, r n , . . . , T2,ri 
can be constructed with an exponential number of switches 
using the recursive construction in Figure 11 where the n 
bits used in f/„_i are r , r„_i, . . . , r 2 , T\. 
Proof: We can prove this inductively. 

Base Case: Uq = fo realizes (0, 1) and (1, 0) when ro = 
and ?'o = 1 respectively. 

Inductive Step: Let ro n = ro + r n , which also implies fo„ = 
fof n . Assume U n -\ realizes the correct distributions given the 
defined inputs. Then we want to show that 



U n — L n + (-, -)R n , 



L„ = r Qn ■ + r Qn U n -i, 

Rn = r QnU n -l + fan ' 1, 
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Fig. 11. 2-state exponential UPG This is a UPG derived directly from the 
B-algorithm. It uses an exponential number of switches since U n uses two 
copies of C/ n _ 1 in its recursive construction. 



This is equivalent to showing, 
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which is equivalent to showing, 
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Here, we are reminded of the B-algorithm (for realizing any 
2-state binary distribution): 
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Then we are almost done with our proof. We know that ro„ 



1 ^ > I and f „ = 1 
inputs to U n _i, we get 



,„ < %. In addition, if 



r is set to generate |^), then using ro,r n _i, ...,r 2 ,ri as 
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Fig. 10. 2-state Universal Probability Generator (UPG) mappings (a) 

A mapping for a UPG that generates distributions of the form (^p, ^r)- (b) 
Removing rg, we look at the inputs to a UPG that generates distributions 
of the form (^p, ^). (c) Notice that the B-algorithm decompositions of the 
probabilities of part (a) correspond to those in part (b). 
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This is because when r = 0, removing r„ is like shifting the 
fractional bits left (multiplying by 2) and then subtracting 1 
if the bit removed was 1. When ro = 1, then removing r n 
doesn't change anything, which still satisfies 1 = 1(2) — 1. At 
this point, we can invoke the B-algorithm to conclude that U n 
will successfully realize |^). ■ 

Lemma 6: A 2-state UPG U n with inputs ro,r„, . . . , T2, t\ 
can be constructed with a linear number of switches using 



either of the two recursive constructions in Figure 12 where 
the n bits used in U n -x are ro, r n _i, . . . , r2, T\. 

Proof: We will prove this by algebraically reducing the 
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Fig. 12. 2-state UPG reduced After reducing the exponential UPG, we get 
two linear UPGs. One is a sp-circuit and uses 2 stochastic switches. The other 
is non-sp and uses 1 stochastic switch. 
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Fig. 14. 2-state UPG removed bit Even though ro was important in 
controlling the cases of our algorithm, it turns out we can remove it from 
the recursive construction and just append it at the end. Note that U' n only 
uses bits r n , ■ ■ ■ , ra, T\ 



exponential UPG. Let p- t = (|, |) be i.i.d. 

U n = L„ + p n R n 

= r 0n ■ + r 0n U n -i +p n (ronU n -i + fon ■ 1) 
= r 0n U n -i + p n (r 0n U n -i + f 0n ) 

= U n -l(f 0n + r nPn) + f 0n Pn 
= Un-l(r n + Pn) + f 0n Pn 

This is exactly the form of the series parallel construction in 



Fig. 12 



U n — L n + p n R n 

= r 0n ■ + f 0n U n -i +p n (r Qn U n -i + fon • 1) 

= r 0n U n -l + Pn(ro n U n -i + f 0n ) 

= [J7„_l][f 0n ] + [ron] [ron] [ron] 

n] [^On] [Pn] 

This is exactly the form of the non-series parallel construction 



in Fig. 12 



The result thus far is nice, but one feels somewhat unsatisfied 
at the number of times ro must be used. It is strange that ro 
is used so many times in the circuit construction even though 
it is only set to 1 for a single distribution in the truth table - 
the deterministic distribution (1,0). We solve this problem to 
get to the final form of Wilhelm and Bruck's 2-state UPG. 

Theorem 8: A 2-state UPG U n with inputs 
ro, r n , . . . , T2, 7*i can be constructed with a linear 
number of switches using either of the two recursive 



constructions in Figure 14 where the n bits used in f/ n _i are 
r ,r n _i, . . .,r 2 ,r 1 . 



Proof: We prove that the circuits in Figure 12 and Figure 
14 are equivalent by using induction. Base Case: Uq = tq = 

Inductive Step: Assume that U n -x = roU^ i _ 1 . Then, 

U n = U n -l(f n fo + Pn) + P n r n fo 

= roUn-i(r n ro + Pn) + p n f n fo 
= r Q (U' n _ x {f n r Q + p n ) +p n f n ) 

= 7-o(C^-l( f n +Pn) +Pnf n ) 

= r U' n 



1 with \ 



In this final form, the series -parallel circuit uses 2n 
pswitches and 2n + 1 deterministic switches. The non-sp con- 
struction uses n pswitches and 'in + 1 deterministic switches. 

Before we can generalize to N states, we need to define 
some new notation and equalities. Define the 2-state UPG 
Uu ,4+1] lTi to be a circuit generating probability |g- on state 
i and on s tate i + 1. 

Under this notation, the previous results we proved were 
for C/[o,i],n- We can extend these results to f/r^j+iin with the 
following changes: 

1) The input bits r will take on values i and i + 1 instead 
of and 1 respectively 

2) The pswitch p n will take on values i and i 
probability each instead of taking on values 0, 1 with 
equal probability. 

But what if we didn't use values i and i + 1 for r,p n 7 

Lemma 7: Let f/fj^+ii n (a, b) be a circuit identical to 
U[i i+i] in except that r, p n take on values a and b. Our previous 
results for probability generation are for f7[i,j+i] )n (M + 1). 
Then if a < i, b > i + 1, 

U[ iti+ i], n (i,i+ 1) = (i + f7[ i)i +i],„(a,6))(i + 1) 

Proof: E/uj+ii^a, b) will realize the desired distribution 
on states a and b. Then if a < i, b > i + 1, it is trivially that 
taking the max of i and the min of i + 1 will give us the same 
distribution on states i and i + 1. ■ 
We want to avoid this messy notation for future generaliza- 



tions. All future instances of U\ 



[z,i+l] ,n 



are actually represent- 



ing (i+£7[ iii+ i] in )(i+l). In other words, £/[ M +i],„ will always 
generate the 'correct' distribution on states i and i + 1 as long 
as r,p n take on values a < i and b > i + 1. 

B. 3-state UPG 

Now we will derive the 3-state UPG by generalizing the 
steps we used in the 2-state UPG derivation. We first construct 
an exponential 3-state UPG that follows closely from the 
algorithm for realizing 3-state distributions. Then, we will 
algebraically reduce it to a quadratic construction and remove 
unnecessarily repetitive bits. 

Definition: (3-state UPG). A 3-state UPG f/[o,2],n is a circuit 
that generates distributions of the form ( |£ , |£ , ||) using 2 
sets of n+ 1 input bits r, s. These input bits will take on values 



or 2; when the bits r, in the order r ,r n ,r n _ 1 , . . . ,r 2 ,ri 
are set to the binary representation of |£ and the bits s, 
in the order sq, s n , s n _i, . . . , S2, Si are set to the binary 
representation of |£ + |£ (with the symbol 2 replacing the 
boolean 1), then the circuit f7rn,2],n will realize distribution 
(fs*) fffj fs^)- In other words, to realize any desired binary 
probability (p-, p.), we set the input bits r and s to po 
and po + pi respectively. 

As an example, we look at the input-output mappings for the 
circuit f7[o,2],2- If we input r = 002, s = 020, the circuit will 
realize (|, '\, §) since \ = 0.01 2 and \ + \ = \ 

0.1 Oj 



0.10 2 . The 

input r = 020, s = 022 will realize (~, |, ~) since | 
| = |= 0.11 2 . (see Figure 13 i). 



and I 

Again, the motivation for the exponential UPG comes 
from an interesting property in the truth table of C/[o,2],2- 
We first eumerate all outputs given inputs corresponding to 
valid probability distributions. For each row (input), we ask: 
What are the outputs of E/[o,i],i) ^[i,2],i> ^[0,2], 1 if we use 
the inputs ro,ri for f7ro,i],i, the inputs so>si f° r f[i,2],i> 
and ro,ri, so, si for <7[o j2 i 2 ? is there a relationship to the 
construction of the ?7ro,2],i output probability. 

The answer is yes. We find that they are the same probability 
distributions that are used in the binary algorithm for 3-states 



(Fig. 13 3c). We can now derive the exponential recursive 
construction. 



Lemma 8: A 3-state UPG Um >2 ],n with inputs 
r ,r n , . . . ,r 2 ,ri and s , s n , . . . , s 2 , Si can be constructed 
with an exponential number of switches using the recursive 



construction in Figure 15 where the bits used in f7[o,i],n-i 
are r , r„_i, . . . , r 2 , ri, the bits used in f7[i,2], n -i are 
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Fig. 15. 3-state exponential UPG We design a UPG directly from the 
algorithm. Note that this uses an exponential number of switches since the 
two copies of C^[o,2],n-l- 



recursion uses t 



So, s n -i, . . . , s 2 , Si, and the bits used in £/[o,2],n-i are 
ro,r n -i,...,r 2 ,ri and s , s„_i, . . . ,s 2 , s x . 
Proof: We can prove this inductively. 

Base Case: t/[o,2].o = ^o(so + 1) realizes (1,0,0) when 
r = 2, (0,1,0) when r = 0, s = 2, and (0,0,1) when 
r = 0, s = 0. 

Inductive Step: Assume (7[o,2].n-i realizes the correct dis- 
tributions given the defined inputs. Then we want to show that 

U[0,2],n = £[0,2],n + (^'O' 2^[°< 2 ]<™' 

L[0,2],n = r 0n - + f „(s „[/[o,l],n-l + S0n^[0,2],n-l)i 
R[0,2],n = ^0n^[0,2],n-l + ^0n (son C^[l,2] ,n- 1 + «0n 1 2) 
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Fig. 13. 3-state UPG mapping The mapping for a UPG that generates distributions of the form ^). Here we notice that j/q is being used a 

couple of times. Again, we don't make predictions about inputs that don't correspond to a probability distribution: this includes both those with a sum greater 
than 1 and those for which the y encoding is less than the x encoding (which would mean a negative 2JL value). 
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with a quadratic number of switches using either of the two 
recursive constructions in Figure 16 where the bits used in 
^[o,i], n-i are r , fn— l, ■ • ■ ) 7*2) 7*1) the bits used in U[i,2],n— l 
are so, Sn-i) ■ • ■ ; s 2, si, and the bits used in t/[o,2],n-i 
ro,r n _i,...,r 2 ,r 1 and s , s„_i, . . . , s 2 , si. 

Proof: We will prove this by algebraically reducing the 
exponential UPG. Let p.; = (g,0, |) be i.i.d. We also introduce 
some new variables to make the final expressions concise. 
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A visual of these is in Figure 16 

that i?[0,2],n = r VnU[Q } 2],n-l + ^0n~R 

similarly except that the J7[ 2 ], ra 
a 2. We notice that, 



^[0,2], 



is defined so 
■ , ., . LL 01 is defined 
! component is replaced by 



Fig. 16. 3-state UPG reduced After reducing the exponential UPG, we get 
two linear UPGs. One is a sp-circuit and uses 2 stochastic switches. The other 
is non-sp and uses 1 stochastic switch. 
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which is equivalent to showing, 

f (l ! 0,0) + (i ! 0,i)C/ [0 ,2],n-i, if r 0n = 2 

U[0,2],n = < ^[0,l],n-l + ( 2 >0, 5 ) U[x,2] ,n- 1 , if ^Ori = 2, S n 

lu'[o,2] 1 »-i + (2,0,|)(0,0,l), if f 0n = 2,s 0n 

Here, we are reminded of the algorithm for realizing any 3- 
state binary distribution: (f^, fi, fj) = 
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From here, it is straightforward to complete our proof. 



ron = 2 

SQn = 2 - 



= 2 



^ J" — 2' 0,1 

> 1, and s „ = 2 
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0^1 ^ f 

2™ ^2" 

then 



^[0,2],n = ^[0,2],n + Pn-%),2],n 

— ■t'[0,2],nt'[0,2],n— 1 +Pn"[0,2],n 
= (l"0n ' + n3n£[o,2],n)^[0,2],n-l 

+ Pn(n>n ^[0,2],n-l + ^0n-R[0,2],n) 
= ^On^[o,2],n^[0,2],n-l + Pn( r 0nU[0,2],n-l + ^0n-R[0,2], 
= ^[0,2],n-l(ro n i[0,2],n + Priori) + Pn f On-R[o,2],n 

This is exactly the form of the series parallel construction in 
Figure [16] 

U[0,2],n = L[o,2],n + PnR[0,2],n 

= -^[O^J.n^O^J.n-l +Pn-R[0,2],n 
= 2 = ( r °» ' + ^On£[o,2],n) t/ [0,2],n-l 

= 2 +Pn(?*0nt r [0 ) 2],n-1 + f On^[o,2],n) 

= ^Oni[o,2],ri^[0,2],n-l + (^0nC^[0,2] ,n- 1 + ^0n-R[0,2], 
= [^[0,2],n-l]fon£p,2],J + [^On-R[o,2],rJ [ r 0«] Nn^[o,2] 
,2],n— l] [''On] [Pn] + [''On-Rjo^] ,n] [^"] 

This is exactly the form of the non-series parallel construction 
in Figure [16] ■ 
Finally, as in the 2-state UPG, we will remove the repetitive 
instances of ro and sq. 

Theorem 9: A 3-state UPG C^[o,2],n with inputs 
ro, r n , . . . , r 2 , r± and s , s «, ■ • • , s 2, s i can b e constructed 
with a quadratic number of switches using either of the two 



In addition, if r, s are set to generate (fs-,f^'f" 
using r ,r n _i, ... ,r 2 ,ri and s , s„_i, . . . ,s 2 , Si as inputs to 
^[0,i],n-i» ^[i,2],n-i and U[ 0i2 ],n-l we get the corresponding 
values to the 3-state binary algorithm. Then we can conclude 
that [/[o, 2 ], n w iU successfully realize fi, f^)- ■ 
Lemma 9: A 3-state UPG t/[o,2],n with inputs 
ro, r n , . . . , r 2 , ri and so, s n , . . . , s 2 , Si can be constructed 



recursive constructions in Figure 17 where the bits used in 

^[o,i],n-i are r , ?*n— i, • • ■ ,^2,^1? the bits used in U[i : 2],n~ l 
are so, s„_i, . . . , s 2 , si, and the bits used in C/[o j2 ],n-i are 
?'o,?'n-i,---,? , 2,? , i and s ,s„_ 1 ,...,s 2 ,si. 

Proof: We will prove this by induction. Assume 

^[0,2], n -i = ^o(so + i)^^],^-!- Then ' b y breaking up the 
circuit into cases based on ro and so, we will show that 

U[ ,2],n = r Q (s + l)U' M>n . 
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Fig. 17. 3-state UPG removed bit Even though ro and so were important 
in controlling the cases of our algorithm, it turns out we can remove it from 
the recursive construction and just append it at the end. 



Case 0: r = 2, s = 2. In this case, we know that 
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Case 2: rp = 0, sq = 0. In this case, we know that 
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Sor, 



f (5o + 1) = 2 
Therefore, f (s + l)^(o,2],n = Pp.q.n = ^[o.aj.n. 
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Fig. 18. TV-state UPG The TV-state UPG is almost exactly the same as the 
3-state UPG. 



Finally, as in the 2-state case, the 3-state UPG E/kj+2], tj will 
generate the appropriate distributions over the states i, i+1, i+ 
2. 

C. N -state UPG 

The results will now be generalized to iV-states. The steps 
and the proof are all identical to the 3-state version except 
that the variables L and R are defined to be generalized 
constructions. Since they are identical, we will define the 
TV-state UPG and then jump to the conclusion and circuit 
diagrams. 

Definition (N -state UPG): A iV-state UPG £7[o,iv-i],n 
is a circuit that generates distributions of the form 



(§£, fs-,..., ^fF 1 ) using iV-1 input vectors r ,ri, r N _ 2 , 
where each vector ri has n + 1 bits r^, r,-i, fi n . When the 
input vectors n = (r i0 , r in , r i(n _i) , . . . , r, 2 , Ux) are set to the 
binary representation of go+g^t^+gi w j m j ne symbol iV — 1 
replacing the boolean 1, then the circuit I/m jv— i], n w ^ realize 



distribution (§£, f^- 



.,„ =•). In other words, to generate any 
desired binary distribution (p-, x ^ ~ 1 ), we set the input 

vector ro to §£, the input vector ri to the sum x ° 2 t 3:i , . . ., and 
the input vector tn-2 to the sum — — i — ^ — ^^-A 

Theorem 10: A iV-state UPG f7[o,jv-i],n with inputs r-j, < 
i < N — 2 can be constructed with a polynomial num- 
ber of switches 0(n *) using either of the two recursive 



constructions in Figure 18 where the bits used in C/ro.-f],**— i 
are ro, rj, . . . , ri_i andthe bits used in f/fijjv-ii.n— l Slle 
ri,r i+ i,...,r N _ 2 - 

VIII. Partial Orders 

All current work has been done on states that are a total 
order, i.e. < 1 < ... < n — 1. It is interesting to think 
about a logic on states that are in a partial order. We perform 
a cursory examination of constructing probabilities on lattices 
that are partial orders and find that we cannot generate many 
distributions on partial orders. 

Composition rules must be generalized again since you 
cannot take a max or min of incomparable states. Instead of 
max and min, we use the V (join) and A (meet) operators 
respectively. 

Theorem 11 (Partial Order Inexpressibility): For the lattice 
in Figure 19 3, no distributions of the form v = (0, 1 — p,p, 0), 



where p > are realizable by building a sp circuit with any 
switch set unless (0, 1 —p,p,0) itself is in the switch set. 

Proof: Assume that v is realizable by a series composi- 
tion. That is, v — xy. Then, 

woo = 2:00 + yoo - £002/00 + £012/10 + £102/01 

=> x 00 = and y o = 
wii = 

=>■ xn = or xn = 

WLOG, we let x n = 0, so that, 

"io = £102/11 + £112/10 + £102/10 

= £102/11 + £102/10 

= £10(2/11 + 2/io) = P 

=>x 10 ^0 
Vol = £012/11 + £112/01 + £012/01 

= £012/11 + £012/01 

= £01(2/11 +2/01) = 1 - p 

=> £01 + 
"00 = £012/10 + £102/01 

=> ?/io = and y 01 = 

=> 2/11 = 1 
=> x = v 




roo=xaoyoo+xaoyoi+xoiyoo+moyio 

+Xioyoo+XDoyii+Xiiyoo+XLoyoi+Xoiyio 

rai—XDiyai+xaiyix+xiiyxii 

n = xi oyi o+xi oyi i+xi iyi 

ni=xii>ii 



Fig. 19. A simple partially ordered lattice. This is an example of a simple 
partial order, (a) An example of a series composition (meet) of switches x 
and y. (b) The lattice structure. 



But then we have x = v, so we have shown that need without 
v itself, we cannot realize v through a series connection. In 
the same way, we can show the same result for a parallel 
composition (left out since it is almost identical), which means 
that v is not realizable with a sp circuit. ■ 

IX. Applications 

One intuitive understanding of this multivalued alphabet 
is to think of timings and dependencies. Switches represent 
events, and the states represent the discrete time when the 
event occurs. When two switches x and y are composed to 
make the circuit z, we are saying that for event z to occur, 
it depends on events x and y. If event z requires both x and 
y to occur before it can occur, then this can be represented 
by a parallel connection since the time z occurs will be the 
m&x(x,y). If event z only needs either x or y to occur, then 
this can be represented by a series connection since the time 
z occurs will be the min(x, j/). 

It is possible to implement multivalued relay circuits using 
physical (open and closed) relay switches. If we consider 
switches as normally closed and the state as the time it 
is opened, then we have a direct correspondence to the 
description above. When switches are composed in series, 
the time that the entire circuit is open, is when either of the 
switches open (min) and when the switches are composed in 
parallel, the time that the entire circuit is open is when both 
the switches are open (max). 

We can imagine a biological neural circuit working in 
a very similar way. Consider a network of neurons where 
each neuron only has incoming signals from 2 other neurons. 
Then, consider a neuron x when incoming neurons y and 
z. Neuron x will fires when the incoming signals exceeds 
a certain threshold potential. If the threshold is low, we can 
imagine that neuron x will fire once it receives the first 
incoming signal (i.e. min(x, y)). If the threshold is high, we 
can imagine that neuron x will only fire if it receives signals 
from both incoming neurons (i.e. max(i, y)). In general, with 
m incoming neuron signals, it becomes slightly more complex; 
depending on the threshold of the neuron of interest, the time 
it fires would be when it receives the 1st, 2nd, 3rd, mth 
signal. These gates can be implemented by max/min gates. 

There are a number of directions for extending this work to 
better model and understand biological circuits. These include 
studying joint distributions, stochastic functions, probability 



distributions on different state structures, and stochasticity 
in compositions. With further understanding of biological 
circuits, this can also aid in building artificial molecular 
circuits, such as in DNA computing! 17]. In addition, stochastic 
circuits may be useful in modeling other stochastic networks 
in engineering. 

X. Conclusion 

In this paper, we studied probability generation in the 
context of multivalued relay circuits, a generalization of two- 
state relays. We proved a duality result on this max/min 
generalization and then proved construction algorithms for 
generating any rational probability distribution. We extended 
the robustness result to these algorithms and showed that 
switch error remains bounded linearly regardless of the circuit 
size. Finally, we constructed a universal probability generator 
for mapping deterministic inputs to probability distributions 
and demonstrated a basic non-realizability result for partial 
orders. 

Further work on multivalued stochastic switches may have 
many biological applications such as for neural coding. 
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